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1 PERTUSSIS TOXI N GENE : CLONING AND EXPRESSION 

2 OF PROTECTIVE ANTIGEN 

<3 This is a continuation in part of the application serial 

4 number 07/843.727 filed March 25, 1986. 

5 The present invention is related to molecular cloning of 

6 pertussis toxin genes capable of expressing an antigen peptide 

7 having substantially reduced enzymatic activity while being 

8 protective against pertussis. More particularly, the present 

9 invention is related to bacterial plasmids pPTX42 and 
10 pPTXSl/6A encoding pertussis toxin. 

State of The Art 

11 Pertussis toxin is one of the various . toxic 

12 components produced by virulent Bordetella pertussis , the 

13 microorganism that causes whooping cough. A wide variety 

14 of biological activities such as histamine sensitization, 

15 insulin secretion, lymphocytosis promoting and immuno- 

16 potentiating effects can be attributed to this toxin. In 

17 addition to these activities, the toxin provides 

18 protection to mice when challenged intracerebrally or by 

19 aerosol. Pertussis toxin is, therefore, an important 

20 constituent in the vaccine against whooping cough and is 

21 included as "a component in such vaccines. 
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Hovever, while this toxin Is, one of the major 
protective antigens against vmooping cough, it is also , 
associated vith a variety of pathophysiological 
activities and is believed to be the major cause of 
harmful side effects associated vith the present 
pertussis vaccine. In most recipients these side effects 
are limited to local reactions, but in rare cases 

a ,i ofl (-h does occur (Baraff et al, 
neurological damage and death does o 

1979 in Third International Symposium on Pertussis. 

, v NIH-79-1830). Thus, a need to produce 
HEW publication Wo. NIH-7V loou, ^ 

a new generation of vaccine against whooping cough is 
12 evident • 

SUMMARY OF THE INVENTION 

It is, therefore, an. object of the present 
mention to clone the gene(s) responsible for expression 

16 ' of pertussis toxin. 

it is a further object of the present invention to 
folate at least a part of the pertussis toxin genome and 
determine the nucleotide sequence and genetic 
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organization thereof. 

It is yet another object of the present invention 

t0 characterize the toxin polypeptide 
cloned gene(s) . at least in terms of the ***°°*<^ 
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I Other objects and advantages of the present 

-2 invention will become evident upon a reading of the 

3 detailed description of the invention presented herein. 

BRIEF DESCRIPTION OF DRAWINGS 

5 These and other objects, features and many of the 

6 attendant advantages of the invention will be better 

7 understood upon a reading of the following detailed 

8 description when considered in connection with the 

9 accompanying drawings wherein: 

10 Fig. 1. shows SDS-electrophoresis of the products of 

II HPLC separation of pertussis toxin. Lanes 1 and 12 

12 contain 5 and 10 ug , respectively, of unf ractionated 

13 pertussis toxin. Lanes 2 through 11 contain 100 >xl 

14 aliquots of elution fractions 19 through 28, 

• -i moiArular weiehts of the subunits are 

15 respectively. The molecular weiguua 

16 indicated; 

17 Fig. 2 shows restriction map of the cloned 4.5 kb 

18 EcoRl/BamHI B, pertussis DNA fragment and genomic DNA in 

19 the region of the pertussis toxin subunit gene. (a) 

20 Restriction map of a 26 kb region of B. pertussis genomic 

21 DNA containing pertussis toxin genes. (b) Restriction 

22 map of the 4.5 kb EcoRl/BamHI insert from pPTX42 . The 

23 arrow indicates the start and translation direction of 
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1 the mature toxin subunit. The location of the Tn5 DNA 

2 insertion in mutant strains BP356 and BP357 is shown, (c) 

3 PstI fragment derived from the insert shown in panel b; 

4 Fig. 3 shows Southern blot analysis of B. pertussis 

5 genomic DNA with cloned DNA probes. (a) Total genomic 

6 DNA from strain 3779 was digested with various 

7 restriction enzymes as indicated on the figure, and 

8 analyzed by Southern blot using nick translated PstI 

9 fragment B of pPTX42 (see Fig. 2c). (b) Between 24 pg 

10 and 60 pg of genomic DNA from strains 3779, Sakairi 

11 (pertussis toxin", Tn5~) , BP347 (non-virulent, Tn5 + ) , 

12 BP349 (hemolysin", Tn5 + ) , BP353 (filamentous 

13 hemagglutinin", Tn5 + ) , Bp356 and BP357 (both pertussis 

14 toxin", TnS" 1 ") (15) (lanes 1 through 7, respectively) were 

15 digested with PstI and analyzed by Southern blot using 

16 nick translated PstI fragment B as the probe. (c) The 

17 same as panel b excet PstI fragment C was used as the 

18 probe; 

19 Fig. 4 shows the physical map and genetic 

20 organization of the Pertussis Toxin Gene. (a) 

21 Restriction map of the 4.5 kb EcoRI/BamHI fragment from 

22 pPTX42 containing the pertussis toxin gene cloned from B. 

23 pertussis strain 3779 (12) . The arrow indicates the 

24 position of the Tn5 DNA insertion in pertussis toxin 

25 negative Tn5-induced mutant strains BP356 and BP357 (24). 
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b) Open reading frames in Che forward direction. c) Open 
reading frames in the backward direction. The vertical 

3 lines indicate termination codons . d) Organizational map 

4 of the pertussis toxin gene. The arrows show the 

5 translational direction and length of the protein coding 

6 regions for the individual subunits. The hatched boxes 

7 represent the signal peptides. The solid bars in SI 

8 represent the regions homologous to the A subunits in 

9 cholera and E. coli heat labile toxins; and 

10 Fig. 5 shows the physical map of the pertussis 

11 toxin S4 subunit gene. a) Restriction map of the 4.5 

12 kilobase pair (kb) EcoRl/BamHI fragment inserted into 

13 P MC1403 . b) Detailed restriction map and sequencing 

14 strategy of the PstI fragment B containing the S4 subunit 

15 gene. Only the restriction sites used for subcloning 
prior to sequencing are shown. Closed circled arrows 
show the sequencing strategy using dideoxy chain 
termination and open circled arrows show the sequencing 
strategy using base-specific chemical cleavage. The 
arrows show the direction and the length of the sequence 
determination. The heavy black line represents the S4 
coding, region, c) Open reading frames in the three 
forward directions, d) Open reading frames in the three 
backward directions. The vertical lines indicate 
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1 DETAILED DESCRIPTION OF INVENTION 

2 The above objects and advantages of the present 

3 invention are achieved by molecular cloning of pertussis 

4 toxin genes. The cloning of the gene provides means for 

5 genetic manipulation thereof and for producing new 

6 generation of substantially pure and isolated form of 

7 antigenic peptides (toxins) for the synthesis of new 

8 generation of vaccine against pertussis. Of course, such 

9 manipulation of the pertussis toxin gene and the creation 

10 of new, manipulated toxins retaining antigenicity against 

11 pertussis but being devoid of undesirable side effects 

12 was not heretofore possible. The present invention is 

13 the first to' clone the pertussis toxin gene in an 

14 expression vector, to map its nucleotide sequence and to 

15 disclose the finger print of the polypeptide encoded by 

16 said gene(s). ' ' 

17 Any vector wherein the gene can be cloned by 

18 recombination of genetic material and which will express 

19 the cloned gene can be used, such as bacterial ( e. g. /Igtll ) , 

20 yeast (e.g. pGPD-l), viral (e.g.pGS 20 or P MM4) and the li 

21 A preferred vector is the microorganism E. coli wherein 

22 pertussis gene has been cloned in the plasmid thereof. 

23 Although any similar or equivalent methods and 

24 materials could be used in the practice or testing ^of the 
present invention, the preferred methods and materials 
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are now described. All scientific and/or technical terms 
used herein have the same meaning as generally understood 
by one of ordinary skill in the art to which the 
invention belongs. All references cited hereunder are 
incorporated herein by reference. 

MATERIALS AND METHODS 

Materials . Restriction enzymes were purchased from 
Bethesda Research Laboratories (BRL) or International 
Biotechnologies, Inc. and used under conditions 
recommended by the suppliers. T4 DNA ligase, M13mpl9 RF 
vector, isopropylthio-^-galactoside (IPTG) , '5-bromo- 
4-chloro-3-indolyl-f-D-g.alactoside (X-Gal), the 17-bp 
universal primer, Klenow fragment (Lyphozyme*) and T4 
polynucleotide kinase were purchased from BRL. Calf 
intestine phosphatase was obtained from Boehringer 
Mannheim, nucleotides from PL-Biochemicals and base 
modifying chemicals from Kodak (dime thylsulf ate, 
hydrazine and piperidine) and EM Science (formic acid). 
Plasmid pMC1403 and E. coli strain JM101 (supE, thi, 

— t 

A(lac-proAB) , [F' , traD36, proAB, lacI^ZAMlS] ) were 
obtained from Dr. Francis Nano (Rocky Mountain Laboratories, 

R 

Hamilton, Montana). Elutip-d columns came from Schleicher & 
Schuell and low melting point agarose from BRL. Radiochemicals 




* • 

-8- 

were supplied by ICN Radiochemicals (crude [f-'^lATP, 
7000 Ci/mmol) and NEN Research Products ([<- 33 5p]dGTP, 800 
Ci/mmole). B. pertussis strain 3779 was obtained from 

Dr. John J. Munoz, Rocky Mountain Lab, Hamilton, Montana. Thi 

strain is also known as 3779 BL2S4 and is commonly available. 
Purification o£ Pertussis Toxin Subunlts : 

Pertussis toxin from B. Pertussis strain 3779 was 

prepared by the method of Munoz et al > Cell Immunol. 

83:92-100, 1984, Five mg of the toxin was resuspended in 

trifluoroacetic acid and fractionated by high pressure 

liquid chromatography, HPLC, using a 1 x 25 cm Vydac C-4 

preparative column. The sample was injected in 50% 

trifluoroacetic acid and eluted at 4 ml/min over 30 min 

with a linear gradient of 25% to 100% acetonitrile 

solution containing 66% acetonitrile and 33% isopropyl 

alcohol. All solutions contained 0.1% trifluoroacetic 

acid. Elution was monitored at 220 nm and two ml 

fractions collected. Aliquots of selected fractions were" 

dried by evaporation, resuspended in gel loading buffer 

containing 2-mercaptoethanol and analyzed by sodium 

dodecylsulphate polyacrylamide gel electrophoresis, 

SDS-PAGE, on a 12% gel. 

Protein and DNA Sequencing : The polypeptide from 
HPLC fraction 21 (Fig. 1, lane 4) was sequenced using a 
Beckman 890C automated protein sequenator according to 
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1 the methods described by Howard et al, Mol. Biochem. 

2 Parasit. 12:237-246, 1984. DNA was sequenced from the 
'3 Smal site (see Fig. 2b) by the Maxam and Gilbert 

4 technique as described in Methods in Enzymol. 65:499-560, 

5 1980. 

6 Isolation of Pertussis Toxin Genes : Chromosomal 

7 DNA was prepared from B. pertussis strain 3779 following 

8 the procedure described by Hull et al, Infec. Immunol. 

9 33:933, 1981. The DNA was digested with both 

10 endonucleases EcoRI and BamHI and ligated into the same 

11 sites in the polylinker of pMC1403 as described by 
!2 Casadaban et al . J. Bacterid. 143:971-980, 1983; 

13 Maniatis et al , Molecular Cloning: A Laboratory Manual, 

14 1982. The conditions for ligation were: 60 ng of vector 

15 DNA and 40 ng of insert DNA incubated with 1.5 units of Tq. 

16 DNA ligase (BRL) and 1 mM ATP at 15° C for 20h. E. coli 

17 JM109 cells were transformed with the recombinant plasmid 

18 in accordance with the procedure of Hanahan, J. Mol. 

19 Biol. 166:557-580, 1983 and clones containing the toxin 

20 gene identified by colony hybridization at 37°C using a 

32, 

21 P-labeled 17-base mixed oligonucleotide probe 21D3 

22 following the procedure of Woods, Focus 6:1-3, 1984. The 

23 probe was synthesized on a SAM-1 DNA synthesizer 

24 (Biosearch, San Rafael, California) and consisted of the 
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1 32 possible oligonucleotides coding for 6 consecutive 

2 amino acids of the pertussis toxin subunit (Table l) . ' 

3 The probe was purified from a 20% urea-acrylamide gel and 

4 5 '-end labeled using 0.2 mCi of (gamma 32 P)ATP (ICN, 

5 crude, 7000 Ci/mmol) and 1 unit of T^ polynucleotide 

6 kinase (BRL) per 10 ul of reaction mixture in 50 mM 

7 Tris-HCl (pH 7.4) 5 mM DTT, 10 mM MgCl 2 . The labeled 

8 oligonucleotides were purified by binding to a 

9 DEAE-cellulose column (DE52, Whatman) in 10 mM Tris-HCl 

10 (pH 7.4), 1 mM EDTA (TE) and eluted with 1.0 M NaCl in 

11 TE. Ten positive clones were isolated and purified. 

12 Plasmid DNA from these clones were extracted according to 

13 the procedure of Manlatis et al . Molecular Cloning: A 

14 Laboratory Manual, 1982, digested with routine 

15 restriction endonucleases (BRL) , and then analyzed by 

16 0.8% agarose gel electrophoresis in TBE (10 mM Tris- 

17 borate pH 8.0, 1 mM EDTA). Southern blot analysis using 

18 the relabeled oligonucleotide 21D3 as the probe showed 

19 that all 10 clones contained an identical insert of B. 

20 pertussis DNA. One clone was used for further analysis 

21 by Southern blots (Fig. 3) and for DNA sequencing. 

22 Southern Blot Analyses : .Extracted DNA as described 

23 supra , was digested and separated by electrophoresis 

24 using either 0.7% or 1.2X agarose gels In 40 mM 
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1 Tris-acetace pH 8.3, 1 oM EDTA for 17 h at 30 V. The DNA 

2 was "then blotted onto nitrocellulose in 20X SSPE, sodium 

3 chloride, sodium phosphate EDTA buffer, pH 7.4, in 

4 accordance with Maniatis et al . , supra , and baked at 80° C 

5 in a vacuum oven for 2 h. Filters were prehybridized at 

o 

6 68 C for 4 h in 6X SSPE, 0.57. SDS, 5X modified Denhardt's 

7 (0.17. Ficoll 400, 0.17. bovine serum albumin, 0.17, 

8 polyvinylpyrrolidone and 0.3X SSPE) and 100 ug/ml 

9 denatured herring sperm DNA. The hybridization buffer 

10 was the same as the prehybridization buffer, except EDTA 

11 was added to a final concentration of 10 mM. PstI 

12 fragments A, B, C and D were isolated by 0.87. low-melting 

13 point agarose gel electrophoresis, purified on Elutip-d 

14 columns (Schleicher and Schuell) and nick translated 

15 (BRL) using (alpha 32 "P)CTP (800 Ci/mmol, NEN Research 

16 Products) . The nick translated probes were hybridized at 

o 

17 a concentration of about 1 uCi/ml for 48 h at 68 C. 

18 Filters were then washed in 2X SSPE and 0.'5% SDS at room 

19 (22°-25°C) temperature for 5 min, then in 2X SSPE and 

20 0.1% SDS at room temperature for 15 min, and finally in 

21 0.1X SSPE and 0.57. SDS at 68°C for 2 h. The washed 

22 filters were air dried and exposed to X-ray film using a 

23 Lightning -Plus intensifying screen following standard 

24 techniques. 
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Isolatlon and cloning of S 4 subunit o^. As 
mentioned above, purified pertussis toxin from B. 
Eertussis strain 3779 was fractionated by high pressure" 
liquid chromatography (HPLC) . One fraction (Fr21) 
. contained a polypeptide which emigrated as a major band 
with subunit S4 on SDS-PAGE (Fig. 1, l ane 4) . Although 
complete separation was not achieved, the major portion 
of the other toxin subunits were recovered in other HPLC 
fractions, i.e., S2 in Fr22, SI and S5 in Fr23, and S3 in 
Fr24 (Fig. 1). The amino acid sequence of the first 30 
NH^-terminal residues of the protein in fraction 21 was 
determined and is shown in Table 1. 
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1 Based on the protein sequence shown in Table 1, a 

r 2 mixed oligonucleotide probe representing a region of six 
• 3 consecutive amino acids with the least redundancy of the 

4 genetic code was synthesized. In this mixture of 

5 oligonucleotides, identified as probe 21D3, approximately 

6 1 out of 32 molecules corresponds to the actual DNA 

7 sequence of the pertussis toxin gene (Table 1). This 

8 mixed oligonucleotide probe was used to screen a DNA 

9 clone bank containing restriction fragments of total 

10 pertussis chromosomal DNA. The clone bank was prepared 

11 by digesting genomic DNA isolated from B . pertussis 

12 strain 3779 with both EcoRI and BamHI restriction 

13 endonucleases. The complete population of restriction 

14 fragments was ligated into the EcoRI /BamHI restriction 

15 site of expression vector pMC1403 and the recombinant 

16 plasmid used to transform E. coli JM109 cells following 

17 standard procedures well known in the art. It is noted 

18 that although E. coli is the preferred organism, other 

19 cloning vectors well known in the art, could, of course, 

20 be alternatively used. 

21 Approximately 20,000 colonies were screened by 

22 colony hybridization using the Sa -p-end labeled 

23 oligonucleotide probe 21D3. The plasmid DNA of 10 

24 positive colonies was examined by restriction enzyme and 
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1 Southern blot analyses. All 10 colonies contained a 

?r recombinant plasmid with an identical 4.5 kb EcoRI/BamHI 

3 pertussis DNA insert. One of these clones, identified as 

4 pPTX42, was selected for further characterization. A 

5 restriction map of the insert DNA was prepared and is 

6 shown in Figure 2b; Southern blot analysis indicated that 

7 the oligonucleotide probe 21D3 hybridized to only the 0.8 

8 kb Smal/PstI fragment. . 

9 A deposit of said pPTX42 clone has been made in 

10 .American Type Culture Collection, Rockville, MD under the 

11 accession No. 6704$.. This culture will continue to be 

12 maintained for at least* 30 years after a patent Issues 

13 and will be available to the public without restriction, 

14 of course, in accordance with the provisions of the law. 

15 Sequencing of the H^N- terminal region for S4 : The 

16 0.8 kb fragment was isolated by agarose gel. 

17 electrophoresis and sequenced using the Maxam and Gilbert 

18 technique, supra . The DNA sequence was translated Into 

19 an amino acid sequence and a portion of that sequence is 

20 compared in Table 1 to the NH^- terminal 30 . amino acids of 

21 the pertussis toxin subunit and the oligonucleotide probe 

22 21D3 sequence. Out of the sequence of 30 amino acid 

23 residues determined using the automated sequenator, only 
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1 2 do not correspond to the amino acid sequence deduced 

2 from "the DNA sequence, i.e., residues 24 and 26 are 

3 questionable because they repeat the amino acid in front 
4- of them and they are located near the end of the analyzed 

5 sequence. Amino acid 15 could not be determined. The 

6 rest of the deduced amino acid sequence perfectly matches 

7 the original protein sequence. The oligonucleotide probe 

8 sequence also perfectly matches the cloned DNA sequence. 

9 These results indicate that at least one of the pertussis 

10 toxin subunit genes has been cloned. 

11 Examination of the DNA sequence indicates that a 

12 precursor protein, perhaps containing a leader 

13 sequence, may exist (Table 1) . In fact, the NH^-terminal 

14 aspartic acid of the mature protein is not immediately 

15 preceded by one of the known initiation codons, i.e., 

16 ATG, GTG, TTG, or ATT, but by GCC coding for alanine, an 

17 amino acid that often occurs at the cleavage site of a 

18 signal peptide. A proline is found at amino acid 

19 position -4, which is also consistent with cleavage sites 

20 In other known sequences where this amino acid is usually 

21 present within six residues of the cleavage site. 

22 Possible translation initiation sites in the same reading 

23 frame as the mature protein and upstream of the NH^ 

24 ' -terminal aspartic acid are; ATG at position -9, TTG at 
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1 ~ 15 » and GTG at -21; however, none of these are preceded 

2 by a Shine/Dalgarno ribosomal binding site (Nature, 

3 London, 254:34-38, 1975) and only GTG at -21 is 

4 immediately followed by a basic amino acid (arginine) 

5 preceding a hydrophobic region, characteristic of 

6 bacterial signal sequences. Using the DMA sequence data 

7 and primer extension to sequence the mRNA, the actual 

8 initiation site could also be determined. 

9 Physical mapping of the S4 gene on the bacterial 

10 chromosome: The 1.3 kb PstI fragment B containing at 

11 least part of the pertussis toxin gene was used as a 

12 probe to physically map the location of this gene on the 

13 B. pertussis genome (Fig. 2). Figure 3a shows a Southern 

14 blot analysis of total B. pertussis DNA digested with a 

15 variety of six base pair-specific restriction enzymes and 

16 probed with the 1.3 kb PstI fragment B isolated from 

17 pPTX42. Each restriction digest yielded only one DNA 

18 band which hybridized with the probe. Since the 1.3 kb 

19 PstI fragment B contains a Smal site, two bands would be 

20 expected from a Smal digest of genomic DNA unless the 

21 Smal fragments were similar in size. Further, analysis 

22 indicated that the single band seen in the Smal digest is 

23 actually a doublet of two similar size DNA fragments. In , 

24 this particular gel, fragments of 1.3 kb and smaller 
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1 migrated off the gel during electrophoresis and thus 

2 couid not be detected; however, in other Southern blots 

3 in which no fragment was run off the gel, only one band 

4 was found for each restriction enzyme. These results 

5 indicate that the gene encoded by the PstI fragment B 

6 occurs only once in the genome. Using the data from 

7 these experiments and similar studies using the 1.5 kb 

8 PstI fragment A and the 0.7 kb Pstl/BamHI fragment D from 

9 the cloned 4.5 kb EcoRI/BamHI fragment, a partial 

10 restriction map of a 26 kb region of the pertussis genome 

11 as shown in Figure 2a was obtained. This method allowed 

12 to locate the first restriction site of a particular 

13 endonuclease on either side of the 4.5 kb EcoRI/BamHI 

14 fragment. This information is useful in deciphering the 

15 genetic arrangement of the toxin genes and for the 

16 cloning of larger DNA fragments of pertussis toxin. 

17 Relationship of the S4 gene and Tn5-insertions ; 

18 Weiss et al,. Infect. Immun. 42:33-41, 1983, have 

19 developed several important Tn5-induced B._ pertussis • 

20 mutants deficient in different virulence factors, i.e., 

21 pertussis toxin, hemolysin, and filamentous hemagglutinin 

22 (Infect. Immun. 43:263-269, 1984; J. Bacteriol. 

23 153:304-309, 1983). To investigate the physical 

24 relationship between the Tn5 DNA insertion and the 
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1 pertussis toxin subunit gene, genomic DNA from these 

2 mutants and strain 3779 by Southern blots using various 

3 restriction fragments of the cloned 4.5 kb EcoRl/BamHI 

4 - DNA fragment as probes were analyzed. In one set of 

5 experiments, blots of genomic PstI fragments were 

6 separately probed with cloned PstI fragments A, B, C, and 

7 D (Fig. 2c). The PstI fragments from the mutants and 

8 strain 3779 which hybridized with the cloned PstI 

9 fragments A, B, and D were exactly the same size; the 

10 blot probed with PstI fragment B is shown in Figure 3b. 

11 However, when the PstI fragment C was used as a probe, 

12 the genomic DNA from mutant strains BP356 and BP357 

13 showed a clear difference in the size of the PstI 

14 fragments that hybridized as compared to strain 3779 and 

15 the other mutant strains (Fig. 3c, lanes 6 and 7). These 

16 results indicate that this fragment contains the site of 

17 the Tn5 insertion. As expected, two labeled fragments 

18 were found, since the Tn5 DNA insert has two symmetrical 

19 PstI sites. Other Southern blots (not shown) in which 

20 genomic Bglll and Smal fragments were hybridized with the 

21 4.5 kb EcoRI/BamHI cloned probe, and the data from Figure 

22 3c, clearly show that the Tn5 DNA was inserted 1.3 kb . 

23 downstream from the start of the mature pertussis toxin 

24 S4 subunit in the two mutant strains that were . 

25 characterized as pertussis toxin negative phenotypes, . 
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1 i.e., BP356 and BP357 (Fig. 2b). This insertion is 

2 beyond the termination codon for the S4 subunit (11.7 
3" kD). Examination of these toxin negative mutants by 

4. Western blots using monoclonal antibodies for individual 

5 subunits indicate that the Tn5 DNA is not inserted in the 

6 subunit structural genes for SI or S2 (unpublished 

7 results). The pertussis toxin negative phenotype of 

8 strains BP356 and BP357 can be explained by either of two 

9 nonexclusive mechanisms. The Tn5 DNA may be inserted 

10 into the coding regions of either S3, S5, or perhaps 

11 another gene required for toxin assembly or transport. 

12 Alternatively, the Tn5 insertion could disrupt the 

13 expression of essential downstream cistrons in a 

14 polycistronic operon. Similar Southern blot analyses of 

15 genomic BamHI and EcoRI fragments indicate that none of 

16 the other virulence factor genes represented by the other 

17 Tn5-insertion mutants, are located within the lVKbregion 

18 de-fined by the first BamHI and the second EcoRI sites as 

19 shown in Figure 2a. 

20 Nucleotide Sequence 

21 Having described the identification, isolation, an 

22 construction of recombinant plasmid P PTX42, containing " 

23 pertussis toxin genes, the insert DNA from this plasmid, 

24 i.e., the 4.5 kb EcoRI / BamHI fragment shown in Fig. 4a, 
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1 was digested with various restriction enzymes and 

2 subcloned by standard procedures (Maniatis et al. f supra) 
"3 using the cloning vectors M13 mpl8 and M13 mpl9 and E. 

fb 4 coli strain JM101 as described by Me^sj-tig , Methods 

5 Enzymol. 101:20-78, 1983. Both strands of the DNA were 

6 sequenced using either the Maxam and Gilbert 

7 base-specific chemical cleavage method, supra , or the 

8 dideoxy chain termination method of Sanger et al., PNAS, 

9 74:5463-5467, 1977, with the universal 17-base primer, or 

10 both. The DNA sequence and the derived amino acid 

11 sequence were analyzed using MicroGenie* computer 

12 software. 

13 Because of the high C+G content of B. pertussis 

14 DNA, it was necessary to use both of the above mentioned 

15 methods with a combination of 8% and 20% polyacrylamide-8 

16 M urea gels for sequence analysis. Each nucleotide has 

17 been sequenced in both directions an average of 4.13 

18 times.' The final consensus sequence of the sense strand 

19 is shown in Table 2. It is noted that the sequence of 

20 the S4 subunit gene has been included in this table for 

21 completeness since this sequence lies in the middle of 

22 the structural gene sequence presented in Table . 2. The 

23 entire sequence contains about 62.2% C+G with about 19.6% 

24 A, 33.8% C, 28.4% G and 18.2% T in the sense strand, 

25 wherein A, T, C and G represent the nucleotides adenine, ' 

26 thymine, cytosine and guanine, respectively. 
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4 
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6 



Assignment of the subuni t clstrons. 

The DNA sequence shown in Table 2 was translated in 
3 all six reading frames and the reading frames are shown 

in Fig. 4b, c. The open reading frame . (ORF) corresponding 
to the S4 subunit was identified and is shown in Fig. 4d. 
The assignment of the other subunits to their respective 

7 ORFs is based on the following lines of evidence: size 

8 of ORFs , high coding probability, deduced amino acid 

9 composition, predicted molecular weights, ratios of 

10 acidic to basic amino acids, amino acid homology to other 

11 bacterial toxins, mapping of Tn5-induced mutations, and 

12 partial amino acid sequence. 

13 Significant ORFs, long enough to code for any of 
the five toxin subunits, were analyzed by the statistical 
TESTCODE algorithm designed to differentiate between real 
protein coding sequences and fortuitous open reading 

17 frames .in accordance with Fickett, Nucleic AcidiRes. 

18 10:5303," 1982. The amino acid composition of each ORF 

19 with a high protein coding probability was calculated, 

20 . starting from either the predicted amino terminus of the 

21 mature proteins or from the first amino acid for the . 

22 mature protein determined by amino acid sequencing of 

23 HPLC purified subunits. These data were then compared 

24 with the .experimentally-determined compositions of the 
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1 individual subunits as described by Tamura et al. 

2 Biochem. 21:5516, 1982. Based on the similarity of the. 

3 amino acid compositions shown in Table 3, all five 

4 subunits were identified and assigned to the ORF regions 

5 shown in Fig. 4d. Table 3 shows that the dedu'ced amino 

6 acid composition from all five assigned subunits are in 

7 good agreement with the experimentally-determined 

8 compositions of Tamura et al su£ra, with two significant 

9 exceptions. First, the SI subunit contains no lysine 

10 residues in the deduced amino acid sequence, whereas 2.2% 

11 lysine was experimentally determined. Second, in 

12 subunits S2, S3, S4, and S5 the proportion of cysteines 

13 were substantially underestimated in the experimentally 

14 observed compositions. These discrepancies, as well as 

15 the remaining minor differences observed for all 

16 subunits, including the previously assigned S4 subunit, 

17 can most reasonably be explained by experimental error 

18 during 'amino acid analysis. Similar analyses, in which a 

19 DNA-deduced amino acid composition was compared with an 

20 experiment ally- derived amino acid composition show the 

21 same minor differences. The absence of lysine residues 

22 in SI may explain why lysine-specific chemical . 

23 modification does not affect the biological and enzymatic 

24 activities of SI. The amino acid composition of the ORFs 

25 (Fig. 4b, c) not assigned to any subunit show no 
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„ nf the experimentally-determined amino 
similarity to any of the exp 
acid compositions, although some of these 
lon S and have a hi g h codin S potential. It 
thaC these regions code for other proteins, perh aps 

_l V ed in the assembly or ^^^^T ' 
The experimentally-estimated molecular 

isoelectric point of the individual -*^« J' , f 
compared to the calculated molecular weight and ~t 
ac idic to basic amino acids of the putative pro ^in 
eroded hy the O^s shown in Tig.,. - expecte f 

, ' fi.ct corresponding differences in the 
ratios reflect corre p 

isoelectric points for each subunit, i.e., the g 

, the lower the isoelectric point. The 
acidic content, the iowe 

risen of the molecular- weights also shows good 
C o«pariso the u lned values, 

correspondence to the ex, ^ 
. ^ slight durances o «*~0 ^ 

theS 5 ^ plin secular weights 

within acceptable limits ror p 

20 determined by SDS-PAGE. 
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Table 4 

Comparison of Tvo Homologous Re S ions in ADP-ribosyiacins aubunita 
" of Pertussis, Cholera, and E. coli Eeac Labile Toxins. 



Region L 



Pertussis SI subunic 

Cholera 3 " A subunic 

E. coll 3 " HLT A subunic 



(8) T7r Arg Tyr Asp Ser Arg Pro Pro (15) 
(6) Ty t Arg Ala Asp Ser Arg Pro Pro (12) 
(6) Tyr Arg Ala As? Ser Arg Pro Pro (13) 



Region 2 



Pertussis SI subunic 
Cholera 3, A subunic 
E. coli a HLT A subunic 



(51) Val Ser Thr Ser Ser Ser Arg Arg ' (53) 
(60) Val Ser Thr Ser He Ser Leu Arg (67) 
(60) Val Ser Thr Ser Leu Ser Leu Arg (67) 



The numners in pa rentheses refer Co the amino acid position in the 
macura proceins. 

^Data from Yamainoto, et al.. FEBS Letter 169; 241, 1983 
HLT = Heat Labile Toxin 
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- The assignment: for SI in the location shown in Fig. 
4d is further supported by a significant homology of two 
3 regions in the SI amino acid sequence with two related 
regions in the A subunits of both cholera and E. coli 
heat labile toxins. These homologous regions, shown in 

6 Table 4, may be part of functional domains for a 

7 catalytic activity in the subunits for all three toxins. 

8 Furthermore, the assignment for SI, as well as the 

9 correct prediction of the signal peptide cleavage site, 

10 is supported by preliminary amino acid sequence data for 

11 the mature protein (unpublished results) . 
Subunits S2 and S3 share 70% amino acid homology, 

which makes the correct assignment of these subunits to 

14 their ORFs difficult if it is based only on the amino 

15 acid composition and the molecular weight. Nevertheless,- 
16. the gene order could be determined as shown in Fig. 4d 

17 based on the location of a Tn5-induced mutation 

18 responsible for the lack of active pertussis toxin in the 
supernatant of the mutant B. pertussis strains. This Tn5 
insertion was mapped 1.3 kb downstream of the start site 

21 for the S4 subunit gene, as indicated by the arrow in 

22 Fig. Aa. As can be seen in Fig. 4, the Tn5-iosertion in 

23 those mutants would be located in the ORF for S3. 

24 Although unable to produce active pertussis toxin, the 

25 mutants are still able to produce the S2 subunit. Thus, 
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I the"Tn5-insertion in those mutants is not located in the 
•2 structural gene for S2. Therefore, the ORFs for S2 and- 

3 S3 could be differentiated. 

4 Ami no acid sequences . 

5 The amino acid sequence for each subunit was 

6 deduced from the nucleotide sequence and is shown in 

7 Table 2. The mature proteins contain 234 amino acids for 

8 SI, 199 amino acids for S2, 110 amino acids for S4, 100 

9 amino acids for S5 and 199 amino acids for S3, in the 
10 order of the gene arrangement from the 5' -end to the 

II 3 '-end. Most likely all subunits contain signal 

12 peptides, as expected for secretory proteins. The length 

13 of the putative signal peptides was estimated after 

14 analysis of the hydrophobic ity plot, the predicted 

15 secondary structure and application of von Heijne's rule 

16 • for the prediction of the most probable signal peptide 

17 cleavage site. The cleavage site for each subunit is 

18 shown in Table 2 by the asterisks. The correct 

19 prediction of the cleavage sites for S4 and SI 

20 (unpublished) was confirmed by amino terminal sequencing 

21 of the purified mature subunits. The length of the 

22 signal peptides varies from 34 residues for SI, 28 

23 residues for S3, and 27 residues for S2, to 21 residues 

24 for S4, and 20 residues for S5. All of the signal 
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1 peptides contain a positively-charged amino terminal 

2 region of variable length, followed by a sequence of ■. 

3 hydrophobic amino acids, usually in cC-helical or 

4 partially -helical, partially & -pleated conformation. A 

5 less hydrophobic carboxy- terminal region follows, usually 

6 ending in a 0 -turn conformation at the signal peptide 

7 cleavage site. All subunits except S5 follow the -1, -3 

8 rule, which positions the cleavage site after Ala-X-Ala. 

9 The amino- terminal charge for the subunit signal peptides 

10 varies between +4 for SI and +1 for S4 and S5. All 

11 . described properties correspond very well- to the general 

12 properties for bacterial signal peptides. 

13 Two different initiation codons are used for the 

14 translation of all subunits in B. pertussis,, i.e., the 

15 most frequently used ATG for SI, S2, S3 and S5, and the 

16 less frequently used GTG for S4. The codon usage (Table 

17 4) is unsuitable for efficient translation of the 

18 pertussis toxin gene in E. coli. This is reflected by 

19 the codon choice for frequently used amino acids, such a 

20 alanine, arginine, glycine, histidine, lysine, proline, 

21 serine and valine. Whether pertussis toxin is a strongl 

22 or weakly expressed protein in B. pertussis and, whether 

23 this expression is regulated by the presence of a precis 

24 relative amount of the different tRNA isoacceptors, 

25 possibly different from E. coli, remains to be 
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I established. This can be evaluated by in vitro 

I 2 translation using E. coll and ^ p ertussi s, cell free .. 

3 extracts. 

4 Closer examination of the amino acid sequence 

5 reveals the striking absence of lysines in SI. Another 

6 interesting feature is the overall relatively high amoun 

7 of cysteines as compared to E. coll proteins. Cysteines 

8 do not seem to be involved in inter-subunit links to 

9 construct the quaternary structure of the toxin, since 
10 all subunits can be easily separated by SDS-PAGE in the 

II absence of reducing agents. Most likely, the cysteines 

12 are involved in intrachain bonds, since reducing agents 

13 significantly change the electrophoretic mobility of all 

14 subunits but S4. Serines, threonines and tyrosines alsc 

15 are represented more frequently than in average E. coll 

16 proteins. The hydroxyl groups of these residues may be 

17 involved in the quaternary structure through hydrogen 

18 bonding .- 

19 Analysis of the fl anking regions. 

20 Since all pertussis toxin subunits are closely 

21 linked and probably expressed in a very precise ratio, 

22 is possible that they are arranged in a polycistronic 

23 operon. A polycistronic arrangement for the subunit 

24 cistrons also has been described for other bacterial 
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1 toxins bearing similar enzymatic functions, such as diptheria, 

2 cholera and E. coli heat labile toxins. Therefore, the flanking 

3 * reagions were analyzed for the presence of transcriptional 

4 signals. In the 5 1 flanking region, starting at position 469, 

5 the sequence TAAAATA was found, which matches six of the seven 

6 nucleotides found in the ideal TATAATA Pribnow or -10 box. An 

7 identical sequence can be fund in several other bacterial 

8 promotors, including the lambda L57 promotor. Given the fact 

9 that most transcripts start as a purine residue about 5-7 

10 nucleotides downstream from the Pribnow box, the transcrip- 

11 tional start site was tentatively located at the adenine residue 

12 at position 482. This residue is located in the sequence CAT, 

13 often found at transcriptional start sites. Upstream from the 

14 proposed -10 box, the sequence CTGACC starts at position 442. 

15 This sequence matches four of the six nucleotides fund in the 

16 ideal E. coli -35 box TTGACA . The mismatching nucleotides in 

17 the proposed pertussis toxin -35 box are the two end nucleotid- 

18 es, of which the 3 1 residue is the less important nucleotide in 

19 the E . coli -35 consensus box. A replacement of the T by a C in 
2 0 the first position of the consensus sequence can also be found 

21 in several E. coli promotors. The distance between the two pro- 

22 posed promotor boxes is 21 nucleotides, a distance of the same 
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length.has been found in the galPl promoter and in 
several plasmid promoters. The proposed -35 box is 
Mediately preceded by tvo overlapping short inverted 
repeats with calculated free energies of -15.6 kcal and 
-8.6 kcal, respectively. Inverted repeats can also be 
found at the 5' -end of the cholera toxin promoter. In 
both cases, they may be involved in positive regulation 
of the toxin promoters. None of the ORFs assigned to the 
other subunit is closely preceded by a similar 
10 promotor-like structure. However, a different 
U premotor-like structure was found associated with the S4 

12 subunit ORF. 

13 The 3' -flanking region has been examined for the 

14 presence of possible transcriptional termination sites. 

15 Several inverted repeats could be found; the most 

16 significant is located in the region extending from 
position 4031 to 4089 and has a calculated free energy of 
.41.4 kcal. None of the inverted repeats are immediately 
followed by an oligo(dT) stretch, which may suggest that 

20 they function in a rho-dependent fashion. Preliminary 

21 experiments indicate, however, that neither inverted 

22 repeat functions efficiently in £. coll (results not 

23 shown) . Whether they are functional in 3. PJ^L . 

24 remains to be established and can be investigated y 

25 . small deletion or site-directed mutagenesis experiments, 
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1 whi'ch are feasible now that the DNA sequence is known. 

2 Another possibility is that the five different subunits - 

3 may not be the only proteins encoded in the polycistronic 

4 operon and that cistrons for other peptides, possibly 

5 involved in regulation, assembly or transport, are 

6 cotranscribed. Non-structural proteins involved in the 

7 posttranslationalprocessing of cbli heat labile toxin 

8 have been proposed. However, no significantly long ORF 

9 was found at the 3' -end of the nucleotide sequence shown 
10 in Fig. 4b. If other proteins are encoded by the same 
11' polycistronic operon, their coding regions must be 

12 located further downstream. 

13 Additionally, the 5 1 -flanking region of each 

14 cistron was also examined for the presence of ribosomal 

15 binding sites. Neither the ribosomal binding sequences 

16 for B. pertussis genes, nor the 3 ! -end sequence of the 

17 16 S rRNA are known. Therefore, the flanking 

only 

18 regions 'could be compared with/ the ribosomal binding 

19 sequences of heterologous procaryotic organisms 

20 represented by the Shine-Dalgarno sequence. Preceding 

21 the SI initiation codon, the sequence GGGGAAG was found 

22 starting at position 495. This sequence' shares^ four out 

23 of seven nucleotides with the ideal Shine-Dalgarno 

24 sequence AAGGAGG. The two first mismatching nucleotides 

25 in the pertussis toxin gene woul'd not destabilize the 
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5 
6 



10 
11 

12 



1 hybridizatin to the 3 '-end of the E. coli 16 S rRNA. 

2 This putative ribosomal binding site is close enough to. 

3 - the initiation codon for SI to be functional in E. coli. 

4 Another possible Shine-Dalgarno sequence overlaps the 
first one and also matches four out of seven nucleotides 
to the consensus sequence. The mismatching nucleotides, 

7 however, have a more destabilizing effect than the ones 

8 found in the first sequence. The S2 subunit ORF is not 

9 closely preceded by a ribosomal binding sequence, which 
may suggest that S2 is translated through a mechanism not 
involving the detachment and reattachment of the ribosome 
between the coding regions for SI and S2. The short 

13 distance between the SI and S2 cistrons, and the absence 

14 of a ribosomal binding site are characteristic of this 

15 mechanism. A ribosomal binding site for S4 in the 

16 sequence CAGGGCGGC, starting at position 2066 is 

17 possible. The ORF for S5 is preceded by the sequence 

18 AAGGCG, 'starting at position 2485, which matches five out 

19 of six nucleotides in the consensus sequence AAGGAG. 

20 Finally, S3 is preceded by the sequence GGGAACAC, which 

21 is very similar to the proposed ribosomal binding site 

22 for SI, i.e., GGGAAGAC. 

23 Taken as a whole, the results described herein 

24 clearly establish the complete nucleotide sequence of all 

25 structural cistrons for pertussis toxin. The gene order, 
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1 as shown in Fig. 4, is SI, S2, S4, S5, and S3. The 

2 calculated molecular weights from the deduced sequence of 

3 the mature peptides are 26,024 for SI; 21,924 for S2; 

4 12,058 for S4; 11,013 for S5 and 21,873 for S3. Since S4 

5 is present in two copies per toxin molecule, the total 
•6 molecular weight for the holotoxin is about 104950. 

7 This is in agreement with the apparent molecular weight 

8 estimated by non-denaturing PAGE. The most striking 

9 feature of the predicted peptide sequences is the high 

10 homology between S2 and S3. The two peptides share 707, 

11 amino acid homology and 75X nucleotide homology. This 

12 suggests that both cistrons were generated through a 

13 duplication of an ancestral cistron followed by mutations 

14 which result in functionally-different peptides. The 

15 differences between S2 and S3 are scattered throughout 

16 the whole sequence and are slightly more frequent in the 

17 amino-terminal half of the peptides. Despite their high 

18 homology, also reflected in the predicted secondary 

19 structures and hydrophilicities, S2 and S3 subunits 

20 cannot substitute for each other in the 

21 functionally-active pertussis toxin. The comparison 

22 between the two subunits may be useful in localizing 

23 their functional domains in relation to their primary, 

24 secondary and tertiary structure. On the basis of the 

25 differences, S2 and S3 are divided into two domains, the 
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amino-terminal and the carboxy-terminal. Each of the 
subunits binds to a S4 subunit. This function could be. 
located in the more conserved carboxy-terminal domains of 
S2 and S3. The two resulting dimers are thought to bind 
to one S5 subunit. This function could be assigned to 
the more divergent amino-terminal domains of S2 and S3. 
Alternatively, it is possible that the dimers bind to the 
S5 subunit through S4 and that the amino-terminal domains 
of S2 and S3 are involved in some other function, 
possibly the interaction of the binding moiety (S2 
through S5) with the enzymatically-active moiety (SI). 

The enzymatically-active SI subunit was compared to 
the A subunits of other bacterial toxins. Two regions 
with significant homology to cholera and E. colt heat 
Xabile toxins were found (Table 4). They are tandemly 
located in analogous regions of all three toxins. 
However, the three amino acid differences found in-these 
regions cannot be explained by single base pair changes 
in the DNA. Furthermore, in most cases the homologous 
amino acids use quite different codons in pertussis toxin 
C0O pared to cholera and E. coll heat labile toxins. 
This, together with the fact that no other significant 
homology in the primary structure could be found and that 
the amino acid sequences of the other subunits are . 
completely different from the sequence of any other 
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I ADP^ibosylating toxin, strongly suggests that pertussis 
-2 toxin is not evolutionarily related to any. of the other. 

3 known bacterial toxins. The limited homology of Si 

4 subunit to the A subunits of cholera and E. coli heat 

5 labile toxins could be due to convergent evolution, since 

6 all three toxins contain a very similar enzymatic 

q 7 and use a relatively closely-related Acceptor 

8 substrate (Ni protein for pertussis toxin and Ns protein 

9 for cholera and E. coli heat labile toxins) . The 

10 NAD-binding site for the two enterotoxins has been • 

II identified at the carboxy- terminal region of their Al 

12 subunit. No significant homology could be found between 

13 the carboxy-terminal of the enterotoxins, nor any other 

14 NAD-binding enzymes, and the analogous region in the SI 

15 subunit. This suggests that the. NAD-binding function of 

16 the ADP-ribosylating enzymes is dependent more on the 

17 secondary or tertiary structures, than on the primary 

18 structures. It is proposed that the two enzymatically- 

19 active domains lie in different regions of the protein, 

20 one at the amino -terminal half of the- subunit for the 

21 acceptor substrate (Ni) binding and the other at the 

22 • carboxy-terminal half of the subunit for the donor 

23 substrate (NAD*) binding. 

24 The presence of a promotor-like structure upstreai 

25 of the Si" subunit cistron and possible transcriptional 
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1 termination signals downstream of the S3 subunit cistron . 

2 suggests that pertussis toxin, like many other bacterial 

3 toxins, is expressed through a polycistronic mRNA. The 
inverted repeats immediately preceding the proposed 

5 promotor may be sites for positive regulation of 

6 expression of the toxin in B. pertussis. Evidence for a 
positive regulation came through the discovery of the vir 

8 gene, the product of which is essential for the 

9 production of many virulence factors, including pertussis 
toxin. Recent evidence in our laboratory suggests that 
the proposed inverted repeats in the 3' flanking region 

12 are not very efficient in transcriptional termination in 

13 E. coll (results not shown). The termination of . 

14 transcription in B.^fe^^ may be carried out by a 

15 slightly different mlchanism than in E. coli; on the 

16 other hand, the polycistron may contain other, not yet 

17 identified, genes related to expression of functionally- 

18 active "pertussis toxin or other virulence factors. We 

19 have described a promotor-like structure preceding 

20 subunit S4 and possible termination signals following the 

21 S4 cistron. The S4 promotor-like structure is quite 

22 different from the proposed promotor at the beginning of.. 

23 SI subunit. It is part of an inverted repeat, suggesting 

24 • an iron regulation of the S4 subunit e*press^|^. 8 

25 supported by the fact that chelating agents ******* 
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I . accumulation of active pertussis toxin in cell 

% supematants. It is thus possible that pertussis toxin 

3 is expressed efficiently by two dissimilar promoters, one : 

4 (promotor 1) located in the 5' -flanking region and the 

5 other (promotor 2) located upstream of S4. Both 

6 promotors would be regulated by different mechanisms. 

7 Promotor 1 would be positively regulated, possibly by the 

8 vir gene product, and promotor 2 would be negatively 

9 regulated by the presence of iron. In optimal expression 
conditions, such as in the presence of the vir gene 

II product and in the absence of iron, the S4 subunit 

12 cistron would be transcribed twice for every n 

13 transcription of the other subunits. This is a mechaksm 

14 that would explain the stoichiometry of the pertussis 

15 toxin subunits of 1:1:1:2:1 for S1:S2:S3:S4:S5, 

16 respectively, in the biologically active holotoxin. 

17 Attempts to express the pertussis toxin gene in E. 

18 coll have been heretofore unsuccessful, although very 

19 sensitive monoclonal and polyclonal antibodies are 

20 available. This lack of expression in E. coli may reside 

21 in the fact that B. pertussis promotors are not 

22 efficiently recognized by the E. coli RNA polymerase. 

23 Analysis of the promoter-like structures of the pertussis 

24 toxin gene and their comparison to strong E. coli 

25 promotors show very significant differences, indeed, of • 
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1 which the most striking ones are the unusual distances 

2 between the proposed -35 and -10 boxes in the pertussis 

3 toxin promotors. The distance between those two boxes in 

4 strong E. coli promotors is around 17 nucleotides, 

5 whereas the distances in the two putative pertussis toxin 

6 promotors are 21 nucleotides for the polycistronic 

7 promotor and 10 nucleotides for the S4 subunit promoter. 

8 Preliminary results in our laboratory using expression 

9 vectors designed to detect heterologous expression 

10 signals which are able to function in E. coli further 

11 indicate that B. pertussis promotors may not be 

12 recognized by the E. coll expression machinery. In 

13 addition, the codon usage for pertussis toxin is 

14 extremely inefficient for translation in E. coli (Table 

15 5) . Preliminary experiments show that the insertion of a 

16 fused lac/trp promotor in the Kpnl site upstream of the 

17 pertussis toxin operon probably enhances transcription 

18 but does not produce detectable levels of pertussis toxin 

19 (unpublished results). Efficient expression in E. coli 

20 would require resynthesis of the pertussis toxin operon, 

21 respecting the optimal codon usage for E. coli. It is 

22 not known whether the coiion usage for pertussis toxin 

23 reflects the optimal codon usage for expression in B. 

24 pertussis , since no other B. pertussis gene has 

25 heretofore- been sequenced. 
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1 The cloned and sequenced pertussis toxin genes are useful 

2 for the development of an efficient and safer vaccine against 

3 whooping cough. By comparison to other toxin genes with similar 

4 bi'ochemical functions and by physical identification of the 

5 active sites either for the ADP-ribosylat ion in the SI subunit or 

6 the target cell binding in subunits S2 through S4, it is now 

7 possible to modify those sites by site-directed mutagenesis of 

8 the B. pertussis genome. These modifications could abolish the 

9 pathobiological activities of pertussis toxin without hampering 

10 its immunogenicity and pr otect ivi ty . Alternatively, knowing the 

11 DNA sequence, mapping of eventual protective epitopes is now made 

12 possible. Synthetic oligopeptides comprising those epitopes will 

13 also be useful in the development of a new generation vaccine. 

14 EXAMPLE 1 

15 The region containing amino acid residues 8 through 15 of 

16 the SI subunit (called "homology box") was chosen for site- 

17 directed mutagenesis which was accomplished by employing standard 

18 methodologies well known in the art. The specific codon changes 

19 and the resultant amino acid alterations are shown in Table 6. 

20 To effect the mutagenic alterations, oligonucleotides 

21 [Beaucage et al, Tetrahedron Lett 22, 1859, (1981)] were 

22 synthesized that incorporated a series of single-codon and 

23 double-codon substitution mutations within the homology box; in 

24 addition, a mutation was also designed that allowed for selective 

25 deletion of the homology region. Two previously described SI 
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1 expression vectors were used for construction of plasmids mutated 

2 in the homology box: pPTXSl/6A and pPTXSl/33B [Cieplak et al , 

3 Proc. Natl. Acad. Scl. U.S.A. 85. 4667 (1988)]. S1/6A is an SI 

4 analog in which the mature amino-terminal aspartyl-aspartate is 

5 replaced with me t h i ony 1 va 1 ine . Both enzymatic activity and mAb 

6 1B7 reactivity are retained in S1/6A, whereas S1/33B has neither 

7 (Cieplak, supra ) . The expression vector for each SI substitution 

8 mutant was constructed in a three-way ligation using the 

9 appropriate oligonucleotide with Acc I and Bsp Mil cohesive ends, 
10 an 1824-bp DNA fragment from pPTXSl/6A (Acc I-SstI), and a 3. se- 
ll kb DNA fragment from pPTXSl/33B (Bsp MU-Sst II). The ligation 

12 and the relatively short length of the oligonucleotides required 

13 for the substitutions was facilitated by the presence of novel 

14 Bsp Mil and Nla IV restriction sites generated in the original 

15 construction of pPTXSl/33B. Deletion of the homology box 

16 involved ligation of mung bean nuclease-blunted Acc I site to the 

17 left of the box in pPTXSl/6A. and an Nla IV site to the right of 

18 the box in S1/33B; this ligation resulted in the excision of 

19 codons for Tyr" through Pro 1 *. Vector construction and retention 

20 of the altered sites were confirmed by standard restriction 

21 analysis and partial DNA sequence analysis. 

22 The expression vector constructions were transformed into E. 

23 coll. and the mutant SI genes were expressed after temperature 

24 induction. In this expression system [Burnette et al. 

25 Bio/Technology 6. 699 (1988)]. the recombinant SI polypeptides 
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I are synthesized at high phenotypic levels (7 to 22* of total 
cell protein) and segregated into intracellular inclusions, 
inclusion bodies were recovered after cell lysis (Burnette . 
supga ) and examined by SDS-polyacrylamide gel electrophoresis 
(PAGE) [U. K . Laemmli, Nature 227, 680 (1970)] (Fig. 6A) . The 
electrophoretic profile revealed that the mutagenized SI products 
constituted the predominant protein species in each preparation 

8 and that their mobilities were very similar to that of the parent 

9 S1/6A subunit. 

10 To examine the phenotypic effects of the mutations on 

II antigenicity, the mutant SI polypeptides were assayed for their 

12 ability to react with the protective mAb 1B7 in an immunoblot 

13 format. The parent construction 6A (Table 6) and each of the 

14 single-codon substitution mutants (5-1. 4-1. 3-1. 2-2. and 1-1) 

15 retained reactivity with mAb 1B7 (Fig. 6B). In contrast, the 

16 reactivity of those mutants containing double-residue substitu- 

17 tions (8-1. 7-2. and 6-1). as well as the mutant in which the 

18 homology box had ' been" deleted (6A-1 ). was significantly 

19 diminished or abolished. 

20 The mutant SI molecules were assayed for ADP- 

21 ribosyltransferase activity by measuring the transfer of 

22 radiolabeled ADP-ribose * f rom " [adenylate-»»P]NAD to purified 

23 bovine transducing [Watkins et al. J Biol Chem . 259.- 1378 

24 (1984); Manning et al . ibid. p. 749]. a guanine nucleotide-binding 
.25 regulatory protein found N in the rod outer segment membranes 

26 [Stryer et al. *™» Cell Biol. 2. 391 (1986)]. As shown in 
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1 Table 6, each of the substitutions appeared to reduce specific 

2 ADP-ribosy 1 ! transferase activity, with the exception of mutants 5- 

3 1 and 2-2, which retained the full activity associated with the 

4 parent 6A species; 6A has approximately 60% of the ADP-riboxyl- 

5 transferase activity of authentic SI (Cieplak, supra ) . Neither 

6 mutant 4-1 nor any of the double-substitution mutants exhibited 

7 any significant transferase activity when compared to the 

8 inclusion body protein control (denoted 20A) ; this control is a 

9 polypeptide of M, 21,678, derived from a major alternative open 

10 reading frame (orf) in the SI gene and does not contain SI 

11 subuni t-related sequences. 

12 The most noteworthy SI analog produced was 4-1 (Arg°- Lys). 

13 It alone among the single-substitution mutants exhibited little 

14 or no transferase activity under the conditions used (Table 6); 

15 however, unlike the double mutants, it* retained reactivity with 

16 neutralizing mAb 1B7. 

17 The results presented herein clearly demonstrate the 

18 importance and magnitude of the critical effect exerted by 

19 substitution of Arg° on the enzymatic mechanisms of the SI 

20 subunit. It is noteworthy in this respect that when the Arg°-Lys 

21 mutation was introduced into full-length recombinant SI, it was 

22 found that transferase activity was reduced by a factor of 

23 approximately 1000. This result establishes that the 

24 substitution at residue 9 is alone sufficient to attain the 

25 striking loss in enzyme activity and that the coincidental 
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1 replacement of the two amino-terminal aspartate residues In the 

2 mature Si sequence with the Met-Val dipeptide that occurs In 

3 S1/6A is not required to achieve this reduction. 

4 m in summary, a mutant gene directing the synthesis of a 

5 mutant PTX polypeptide containing the protective epitope, but 

6 with substantially reduced enzyme activity has been produced. A 

7 safe vaccine against pertussis. in accordance with the present 

8 invention, is produced by a composition comprising immunogenic 

9 amount of the mutant PTX polypeptide in a pharmaceutical ly 
10 acceptable carrier. The term "substantially reduced" enzyme 
U activity as used herein means more than about 1000 fold less 

12 enzymatic activity or almost negligible enzyme activity compared 

13 to the normal (wild type) activity. 

14 It is understood that the examples and embodiments described 

15 herein are for illustrative purposes only and that various 

16 modifications or changes in light hereof will be suggested to 

17 persons skilled in the art and are to be included within the 

18 spirit and purview of this application and the scope of the 

19 appended claims. 
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Table 6-ADP-ribosylrransfcrasc activity of recombinant SI mutant polypeptides. Intracellular inclu- 
sions containing the recombinant subunits produced in £. coli were recovered by differential 
ccntrifugation and extracted with 8Af urea (J S). The urea extracts were adjusted to a total protein 
concentration of 0.6 mg/ml, dialyzcd against 50 mM tris-HCI (pH 8.0), and then centrifuged at 
14,000g for 30 min. The amount of recombinant product in the supernatant fractions was determined 
by quantitative densitomctric scanning of proteins separated by SDS-PAGE and stained with 
Coomassie blue. ADP-ribosyltransfcrasc activity was determined {17) with the use of 4.0 *ig of purified 
bovine transducin and 100 ng of each SI. analog. The values represent the transfer of [ 32 PJADP-ribosc 
to the a subunit of transducin, as measured by total trichloroacetic acid-precipitablc radioactivity, and 
each is given as the mean of triplicate determinations with standard deviation. The 20A product 
represents a negative control because its synthesis results in the formation of intracellular inclusions that 
lack SI -related proteins. 



Mutant 


Amino 


Codon 
change 


. ADP-ribosyl- 


desig- 
nation 


acid 


transfcrasc 


change 


activity (cpm) 


6A 


None 


None 


23,450 ± 950 


5-1 


Tyr 8 ->Phc 


TAC-+TTC 


26,361 ± 1,321 


4-1 


Arg 9 -» Lys 


CGC-* AAG 


754 ± 7 


3-1 


Asp"->G!u 


GAC-* GAA 


13,549 ± 1,596 


2-2 


Ser ll -»Gly 


TCC-+GGC 


22,319 ± 2,096 


1-1 


Arg ,J -> Lys 


CGC-» AAG 


7,393 ± 1,367 


8-1 


Tyr 8 -+ Leu 
Arg 9 -»Glu 
Arg 9 -+ Asn 


TAC-+TTG 


926 ± 205 




CGC-»GAA 




7-2 


CGC— ► AAC 


753 ± 30 




Scr ,J -»Giy 


TCC-»GGC 




6-1 


Asp" -* Pro 
Pro 14 -* Asp 


GAC— CCG 
CCG -> GAC 


x 764 ± 120 


20A 


Alternate SI orf 




839 ± 68 
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